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Zinc-finger nucleases (ZFNs) have proved to be successful tools for targeted genome 
manipulation in several organisms. Their main property is the induction of double-strand 
breaks (DSBs) at specific sites, which are further repaired through homologous 
recombination (HR) or non-homologous end joining (NHEJ). However, for the appropriate 
integration of genes at specific chromosomal locations, proper sites for gene integration 
need to be identified. These regions, hereby named safe harbor loci, must be localized 
in non-coding regions and possess high gene expression. In the present study, three 
different ZFN constructs (pZFNI, pZFN2, pZFN3), harboring p -glucuronidase (GUS) as 
a reporter gene, were used to identify safe harbor loci on rice chromosomes. The 
constructs were delivered into IR64 rice by using an improved Agrobacterium-med'iaXed 
transformation protocol, based on the use of immature embryos. Gene expression was 
measured by histochemical GUS activity and the flanking regions were determined 
through thermal-asymmetric interlaced polymerase chain reaction (TAIL PGR). Following 
sequencing, 28 regions were identified as putative sites for safe integration, but only one 
was localized in a non-coding region and also possessed high GUS expression. These 
findings have significant applicability to create crops with new and valuable traits, since 
the site can be subsequently used to stably introduce one or more genes in a targeted 
manner. 
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INTRODUCTION 

Rice {Oryza sativa L.) has emerged as a model cereal system for 
molecular studies as the complete genome has been sequenced, 
several tools for fonctional genomics are available, and the pro- 
duction of transgenic plants by efficient Agrobacterium-mediated 
transformation is easier than with other major cereals (Izawa 
and Shimamoto, 1996). In addition, rice is one of the leading 
food crops worldwide and increasing rice production is expected 
to play a significant role in reducing hunger and upgrading the 
economic status of developing countries. 

Nowadays, due to recent advances in molecular biology, 
research focuses more and more on the ability to manipulate 
genomes at specific sites. Efficient methods for genome edit- 
ing forther promote gene discovery and functional gene analyses 
in model plants as well as the introduction of novel desired 
agricultural traits in important species. Genome editing technol- 
ogy enables efficient and precise genetic modification through 
the induction of a double-strand break (DSB) in a specific 
target sequence, followed by the generation of desired mod- 
ifications during the subsequent DNA break repair (Puchta, 
2002). Genome editing is achieved by integrating desired DNA 



molecules into the target genome by employing mainly the 
homologous recombination (HR) pathway. However, in plants, 
these molecules are normally delivered by direct gene-transfer 
methods and often integrate into the target cell genome via 
non-homologous end joining (NHEJ) instead of HR (Britt and 
May, 2003). Currently, zinc-finger nucleases (ZFNs), transcrip- 
tion activator-like effector nucleases (TALENs), and clustered 
regulatory interspaced short palindromic repeats (CRISPR)/Cas- 
based RNA-guided DNA endonucleases are used as innovative 
techniques in genome editing (Gaj et al., 2013). These nucle- 
ases diverge in different aspects, starting from the composition, 
to specificity and mutation signatures (Kim and Kim, 2014). 
Knowledge of their specific features is essential for choosing the 
most appropriate tool for a range of applications. 

ZFNs were among the first tools used for genome editing 
a decade ago, and are defined as artificial restriction enzymes 
composed of a fusion between the DNA-binding domain of a 
zinc-finger protein (ZFP) and the cleavage domain of the Fokl 
endonuclease. The DNA-binding domain of ZFPs can be engi- 
neered to recognize a variety of DNA sequences, while the Fokl 
endonuclease domain, which functions as a dimer, cleaves the 
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DNA and creates DSBs (Durai et al., 2005; Porteus and Carroll, 
2005). Through directed co-localization and dimerization of two 
Fokl nuclease monomers, ZFNs generate a functional site-specific 
endonuclease that creates a DSB at the targeted locus (Mani 
et al., 2005). Through the use of this methodology, the induced 
DNA sequence modifications can range from mutations to gene 
replacement, site-specific structural changes, or gene insertion, to 
name a few (Husaini et al., 201 1). 

DNA repair of DSBs is primarily carried out through HR and 
NHEJ. Depending on the desired modification, either pathway 
can be used in ZFN-mediated genomic engineering. Since HR 
relies on homologous DNA to repair the DSB, gene targeting 
can be achieved by supplying an exogenous template, termed a 
donor sequence, which is replicated and mostly used to intro- 
duce small mutations or large insertions. On the other hand, 
NHEJ is an error-prone repair process, ideal for generating muta- 
tions that can result in gene knockouts or knockdowns when 
the ZFN-mediated DSB is introduced into the protein coding 
sequence of a gene (Bibikova et al., 2003; Urnovet al., 2010). ZFNs 
have been successfully used for inducing DSBs in the genomes 
of various species, including plants (Lloyd et al., 2005; Wright 
et al., 2005). Successful HR-based gene replacement was observed 
at frequencies ranging from 0.2 to 4% in tobacco protoplasts, 
where acetolactase synthase genes SurA and SurB were mutated 
to confer resistance to herbicides (Townsend et al., 2009). In 
maize, Shukla et al. (2009) showed that insertional disruption of 
the IPKl gene, encoding the inositol- 1, 3,4, 5,6-pentakisphosphate 
enzyme, resulted in both herbicide tolerance and alteration of the 
inositol phosphate profile in developing seeds. In addition, the 
trait/modification was stably transmitted to the next generation. 

For the successful integration of genes at specific chromo- 
somal locations, it is of utmost importance to identify proper 
sites for gene insertion. The results of Day et al. (2000) have 
shown that a transgene can be delivered into a specific chromo- 
some position; this will allow the selection of a specific target site 
for a consistent and higher transgene expression. Therefore, the 
ability to achieve site-specific manipulation of the rice genome 
can improve the expression of transgenes as it is highly depen- 
dent on the locus of integration. These integration regions must 
possess high gene expression and preferably be localized in 
non-coding DNA regions (Curtin et al., 2012; Sadelain et al, 
2012). In the present study, ZFNs were employed in order to 
identify such regions, hereby designated as safe harbor loci, 
on rice chromosomes. Three different ZFN constructs, contain- 
ing ^-glucuronidase (GUS) as a reporter gene, were used. The 
level of gene expression in different loci was measured through 
GUS assay, while the flanking regions were determined through 
thermal-asymmetric interlaced polymerase chain reaction (TAIL 
PGR). This represents the first report on the potential use of ZFNs 
for the identification of safe harbor loci in plants. A number 
of important agronomic traits to improve rice for higher yield, 
tolerance of environmental stresses, and metabolic engineering 
are polygenic in nature. A large number of genes are needed to 
modify the metabolic pathway; the safe harbor loci wiU allow 
pyramiding of transgenes in one locus. The results presented here 
can be of great practical applicability in generating crops with 
improved agronomic traits. 



RESULTS 

GENERATION OF TRANSGENIC RICE PLANTS USING ZFN CONSTRUCTS 

Three different constructs, pZFNl, pZFN2, and pZFN3, were 
used to generate transgenic rice plants. The vector system is based 
on the assembly of ZFN expression cassettes, a plant selection 
expression cassette, and a GUS reporter cassette, onto the plant 
binary vector pRGS2. The constructs contain the hpt (hygromycin 
phosphotransferase) gene driven by the octopine synthase pro- 
moter (OcsP), while the GUS and ZFN expressions are driven 
by a heat-shock inducible promoter (hspP; GenBank Acc. No. 
NG_003076.8). The difference between pZFNl and pZFN2 con- 
sists of the length of the hpt gene. Each construct was introduced 
into A. tumefaciens LBA4404 and subsequently co-cultivated with 
rice immature embryos. In the case of pZNF3, co-transformation 
of two binary vectors, one carrying the plant selection marker 
and the reporter repair plasmid and the other carrying only 
the constitutive ZFN expression cassette, was used (Figure 1). 
Agrobacterium-mediated transformation steps are summarized in 
Figure 2. Constructs pZFNl and pZFN2 generated 171 (85.5%) 
and 133 (88.5%) calli resistant to hygromycin, while the pZNF3 
construct showed the highest number of resistant calli (439, 
146.3%). From the regenerable callus culture, 29 GUS-positive 
plants were obtained for pZFNl, 60 for pZFN2, and 188 for 
pZFN3 (Table 1). Based on the number of immature embryos 
used and the plants obtained, transformation efficiency was cal- 
culated for each construct. Results are shown in Table 1. The 
highest transformation efficiency (66.3%) was registered when 
the pZFN3 construct was used. 

GUS EXPRESSION LEVELS IN RICE TRANSGENICS 

Following Agrobacterium-mediaXed transformation, rice-positive 
transformants were identified based on histochemical GUS detec- 
tion. Two-week-old plantlets were initially incubated at 42° C for 
90-150 min in order to trigger expression of the GUS gene, which 
is driven by a hspP. Using the Image J software, pixel density is 
measured based on the blue color present in plant tissue. The 
numerical values obtained, were then categorized in three differ- 
ent levels of GUS intensity (high, medium, and low). Figure 3 
shows the histochemical GUS analysis. The number of positive 
events categorized accordingly with the levels of GUS expres- 
sion is presented in Table 2. Out of 29 positive events obtained 
using pZFNl, 8 showed high expression, while 21 events showed 
low GUS expression. In the case of pZNF2, out of 60 positive 
events, only 1 presented high GUS expression, 8 showed medium 
expression, and 51 showed low expression. The highest number 
of events with high GUS expression (113) was obtained when the 
pZFN3 construct was used. 

IDENTIFICATION OF FLANKING SEQUENCES AND SAFE HARBOR LOCI 
REGIONS 

Genomic DNA was extracted from all positive events and a 
two-step TAIL PGR was performed in order to identify the flank- 
ing sequences (Figure 4). AU GUS-positive plants with different 
levels of expression gave specific TAIL-PCR products ranging 
from 500 to 1000 bp. Bands were subsequently purified from 
gel, cloned and sequenced. Following bioinformatic analysis, 28 
sites for GUS insertion were identified (Supplementary Table 1). 
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FIGURE 1 1 Structure and key features of the pZFN1, pZFN2, £ 


nd pZFN3 for the modification generated in the cassette. The plasmid carries a plant 


constructs. The vector system is based on tine assembly of ZFN expression expression cassette engineered for constitutive expression of a mutated uidA 


cassettes (pSAT4.hspRZFN and pSATB.hspRZFN), a plant selection (GUS) gene. A stop (TGA) codon was engineered within the 6-bp spacer of 


expression cassette (pSATl.hpt), and GUS reporter cassette 


the ZFN target site, leading to premature termination of uidA translation in 


(pSAT6A.ZFN-TS*::GUS) onto the plant binary vector pRCS2. Asterisk stands plant cells. The ZFN QQR recognition site is shown. 




FIGURE 2 1 Generation of transgenic rice plants by immature embryos with Agrobacterium suspension; (C) selection of 

^grobactenum-mediated transformation. (A) Growth of resistant calli; (D) regeneration of plantlets; (E) rice plantlets on 

A. tumefaciens LBA4404 on AB medium; (B) co-cultivation of MSO rooting media. 
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FIGURE 3 I Histochemical GUS staining; (A) Wild type; (B) Transgenic 
rice exhibiting high GUS expression; (C) Transgenic rice exhibiting 
medium GUS expression; (D) Transgenic rice exhibiting low GUS 
expression. 



Table 1 | Transformation efficiency of embryogenic calli derived from 
immature embryos of IR64 rice infected with A. tumefaciens LBA4404 
containing ZFN constructs. 



Constructs 


Immature 


Resistant 


Regenerated 


GUS- Transformation 




embryos 


calli (%) 


calli (%) 


positive 


efficiency (%) 










plants 




pZFNI 


200 


85.5 


32.2 


29 


275 


pZFN2 


150 


88.7 


49.6 


60 


44.0 


pZFN3 


300 


146.3 


45.3 


188 


66.3 


However, 


some of these re\ 


;ions were 


too short 


to be con- 



sidered as safe harbor loci, while others presented low GUS 
expression. Out of the putative sites identified, three sequences 
chosen from plants exhibiting high GUS expression also pre- 
sented a proper size (Table 3). One event showed integration 
on chromosome 1 (3404275-3405012), three independent events 
presented integration on chromosome 8 (5490900-5491654), 
and four independent events were integrated on chromo- 
some 3 (8499895-8500138). When the sequences were verified 
for the presence/absence of coding genes, the BLAST results 
showed that the region on chromosome 1 is part of a gene 
(LOC_Os01g07212) encoding a putative staphylococcal nuclease 
homolog. Similarly, the region located on chromosome 8 is part 
of a gene (LOC_Os08g09480) coding for OsFBX268, an F-box 
domain-containing protein. Only the locus on chromosome 3 was 
shown to be localized in a non-coding DNA region (Table 3). 
Two putative genes, LOC_Os03gl5470 and LOC_Os03gl5480, 
are located near this region (Supplementary Figure 1), but the 
243-bp sequence on chromosome 3 (8499895-8500138) is con- 
sidered as non-coding. Since this region was identified from 
plants with high GUS expression, and no putative coding gene, 
it can be considered as a safe harbor locus for gene insertion. The 
nucleotide sequence was also converted to amino acid sequence, 
and no putative protein was shown to be encoded in this region. 

DISCUSSION 

In the present study, the ZFNs characteristic of inducing a DSB 
and subsequently trigger a response of proper DNA-repair path- 
ways, was used to successfully insert the p -glucuronidase marker 
gene into the rice genome with the purpose of identifying safe 
regions for gene integration. The originality of this work derives 
from the use of ZFNs as an innovative technique for plant genome 
editing, associated with a newly standardized protocol for rice 
Agrobacterium-mediated transformation. Furthermore, the final 
result, identification of safe harbor loci for gene insertion, has a 
great impact related to practical applications in agriculture. 

Several strategies can be used for ZFNs to modify the genome 
of plant species, depending on the presence and structure of the 
donor DNA and the plant DNA-repair machinery. Targeting a 
specific genomic sequence requires the delivery and expression 
of two ZFN monomers in the same cell, with the final goal of 
inducing site-specific mutagenesis, gene stacking, and/or gene 
replacement (Urnov et al, 2010). 

Three constructs carrying the semi-palindromic target site of 
QQR ZFN were employed to generate transgenic rice plants. 



Table 2 | Events positive for GUS integration separated into groups 
(high, medium, and low) based on the intensity of GUS staining 
expression. 



Constructs 


Total 


High GUS 


Medium GUS 


Low GUS 




events 


expression 


expression 


expression 


pZFNI 


29 


8 


0 


21 


pZFN2 


60 


1 


8 


51 


pZFN3 


188 


113 


40 


30 



QQR (Glutamine-Glutamine-Arginine) ZFN is a well-defined 
three-finger ZFN capable of recognizing and binding to 5'- 
GGGGAAGAA-3' nucleotide sequence. It was among the first 
chimera nucleases used and since then was successfully applied 
for genome engineering in both animals and plants (Smith et al., 
2000; Weinthal et al., 2013). The vectors are based on the struc- 
ture of a previously described pSAT plant expression vector 
system specifically designed to facilitate the assembly of multi- 
gene expression cassettes (Tzfira et al., 2005). The expression of 
the QQR coding sequence, as well as the expression of the GUS 
gene, is controlled by a hspR This type of construct was suc- 
cessfully used in generating targeted mutations in Arahidopsis 
plants (Lloyd et al., 2005). The authors also estimated that QQR, 
when expressed under a heat-shock promoter, can induce muta- 
tions at a frequency as high as 0.2 mutations per gene, which is 
considerably higher than the previously reported HR-dependent 
frequencies (from 10^^ to 10^*) for plant cells (lida and Terada, 
2005). 
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FIGURE 4 I Agarose gel analysis of TAIL PGR products amplified from GUS-positive insertion lines. Bands sliown in boxes were cut and sequenced. 
ADl , AD2, ADS, and AD4 = non-specific primers. 



Table 3 | Chromosomal localization and annotation of putative safe harbor loci identified in transgenic rice plants obtained by using ZFN 
constructs. 

Sample ID GUS expression Chromosome region Sequence length (bp) Gene annotation Protein 
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High 
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243 
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High 
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243 


Non-coding 


Non-coding 
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LOC_Os01g07212 
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F-box domain 
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Among the three different constructs used for plant trans- 
formation, pZNF3 showed the highest transformation efficiency 
(66.3%) as well as the highest number of GUS-positive events 
(188 plants). This finding suggests that co-transformation of two 
separate vectors (one carrying the plant selection marker and the 
reporter repair plasmid, and the other carrying only the consti- 
tutive ZFN expression cassette) is more efficient than when the 
ZFNs and donors are delivered together in the same construct. In 
addition, the pZFN2 construct possessing a shorter variant of hpt 
gene showed higher transformation efficiency (44%) than pZFNl 
(27.5%). 

One of the main goals of genetic engineering is to attain stable 
transgenic events possessing predictable and reproducible lev- 
els of expression that can be characterized in terms of the effect 



and implications of transgene insertion. In order to achieve this, 
it is important to gather detailed information about the trans- 
gene insertion site. The main attribute for safe regions to be 
used as gene integration sites is localization in non-coding DNA 
regions that permit high gene expression (Curtin et al, 2012; 
Sadelain et al., 2012). Techniques used for this purpose include 
detection of the physical position of transgenes by fluorescent 
in situ hybridization (FISH) (Salvo-Garrido et al, 2001; Choi 
et al, 2002) as well as genetic map position (Salvo-Garrido et al., 
2004). Information on transgene insertions can also be obtained 
from the analysis of flanking regions (Sha et al., 2004). To iden- 
tify the insertion site of the T-DNA flanking sequence, analyses 
were conducted using TAIL PGR. The method consists of the 
use of nested T-DNA border region-specific primers together 
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with shorter arbitrary degenerate primers for the unknown 
genomic DNA region flanking the insertion site (Liu et al., 1995). 
Such priming creates both specific and non-specific products, 
whose relative amplification efiiciencies can be thermally con- 
trolled. In two serial PCRs, the unspecific products are gradually 
diluted out and in the final reaction the specific products are 
detectable on the gel by a slight shift in size due to the nested 
priming in the T-DNA region. In the present study, 28 flank- 
ing sequences for the GUS gene were identified fijUowing rice 
transformation with ZFN constructs. However, most of these 
sites possessed low expression and/or were localized in DNA 
coding regions. Only one region, located on chromosome 3 
(8499895-8500138), retained both attributes and it can be con- 
sidered as a safe harbor locus for gene insertion. This finding 
is highly important for future practical applications that could 
lead to the creation of crops with new valuable traits. In addi- 
tion, ZFNs as part of new plant breeding technologies allow 
genome editing without the introduction of foreign DNA; thus, 
the resulting crops could be classified as non-GMO (Pauwels 
etal, 2014). 

MATERIALS AND METHODS 
VECTOR DESIGN 

The constructs used in the present study were assembled and 
validated as per Tovkach et al. (2010). A schematic repre- 
sentation of plasmid maps is shown in Figure 1. The vector 
system is based on the assembly of ZFN expression cassettes 
(based on pSAT4.hspP.ZNF and pSATS.hspPZNF plasmids), a 
plant selection expression cassette (based on pSATl.hpt plas- 
mid), and a GUS reporter cassette (based on pSAT6A.ZFN- 
TS*::GUS plasmid) onto the plant binary vector pRCS2. The 
nucleotide sequences of pSATl.hpt, pSAT6A.ZFN-TS*::GUS, 
pSAT4.hspP.ZNF, and pSATS.hspP.ZNF plasmids are available 
on the NCBI site. The plant selection cassette contains the hpt 
gene, which confers resistance to hygromycin and is driven by 
the OcsP. The GUS and ZFN expressions are driven by a hspP 
(GenBank Acc. No. NC_003076.8). The GUS reporter cassette 
contains a mutated GUS reporter gene, engineered to carry 
a TGA (stop) codon within the 6-bp spacer of the ZFN tar- 
get site, constructed on a reporter repair plasmid. The QQR 
ZFN recognition site is 5'-GGGGAAGAA-3'. Three different con- 
structs were used and they were designated as pZFNl, pZFN2, 
and pZFN3 (Figure 1). The difference between pZFNl and 
pZFN2 consists in the length of the hpt gene. As for pZNF3, 
it requires co-transformation of two binary vectors: one carry- 
ing the plant selection marker and the reporter repair plasmid 
and the other carrying only the constitutive ZFN expression 
cassette. 

/l6/?0e4Crf/?/(/M-MEDIATED TRANSFORMATION AND PLANT 
REGENERATION 

Agrobacterium-mediated transformation of IR64 rice (Oryza 
sativa L. indica) was performed as described by Slamet-Loedin 
et al. (2014). Immature embryos (IE) harvested from rice pan- 
icles at 8-12 days after anthesis were dehuUed and sterilized. 
IE were co-cultivated with A. tumefaciens LBA4404 for 7 days 
at 25°C under dark conditions in A201 medium. Following 



co-cultivation, IE were transferred onto sterile Petri dishes con- 
taining A202 selection medium complemented with hygromycin 
and incubated under continuous light at 30°C for 5 days (first 
selection). Subsequently, lEs were placed on A203 medium and 
incubated under the same conditions for 10 days (second selec- 
tion). After this period, the embryogenic calli were selected and 
placed on A203 medium under the same conditions and time 
period (third selection). 

For plant regeneration, the resistant calli were incubated on 
pre-regeneration medium (A204) under continuous light at 30°C 
for 10 days. Proliferating calli were selected and transferred onto 
regeneration medium (A205). Subsequently, individual regener- 
ated plantlets were placed in sterile glass tubes containing MSG 
rooting medium and kept under continuous light at 25° C for 14 
days. Plant material was then used for further analysis. 

HISTOCHEMICAL GUS ASSAY 

The location of GUS activity in plant tissues was determined 
histochemically as described by lefferson et al. (1987). The 
GUS reaction mix consisted of the following: 50 mmol/L potas- 
sium ferrocyanide, 50 mmol/L potassium ferricyanide, 5 mL 
0.2 mol/L sodium phosphate buffer, 0.5 mol/L sodium EDTA and 
10% Triton X-100, and water. A separate solution of X-Gluc 
(5-Bromo-4-Chloro-3-Indolyl-Beta-D-Glucuronide) (Biosynth, 
Switzerland) at a concentration of 25 mg X-Gluc/mL of N-N 
dimethyl formamide was added to this reaction mix at a ratio of 
352 |xL of reaction mix to 48 \iL of X-Gluc solution. In order to 
trigger the expression of the GUS gene, 2-week-old rice plantlets 
were incubated for 90-150 min at 42°C and subsequently recov- 
ered for an additional 24-72 h prior to GUS staining. Plantlets 
were then incubated in the GUS solution at 37°C for 16 h. The 
GUS solution was discarded and the plantlets were rinsed with 
water and bleached sequentially with 25, 50, and 75% ethyl alco- 
hol and finally kept in 95% ethyl alcohol. Quantification of GUS 
activity was performed by using ImageJ (v. 1.45) software to 
identify the intensity of GUS gene expression and accordingly 
the transgenics were labeled as having high, medium, and low 
expression. 

GENOMIC DNA EXTRACTION 

Rice genomic DNA was extracted and purified following the 
protocol described by Dellaporta et al. (1983) with some modi- 
fication in the extraction buffer as follows: 1 M Tris-HCl (pH = 
8.0), 0.5 M EDTA, and 4 M NaCl and sodium bisulfate. Following 
extraction, genomic DNA was measured spectrophotometrically 
(NanoDrop ND-1000, NanoDrop, USA) by UV absorption at 
260 nm, while DNA purity was evaluated on the basis of the 
UV absorption ratio at 260/280 nm and analyzed by 1% agarose 
gel electrophoresis in 1 x TAE SYBR SAFE® (Invitrogen, USA) 
staining. 

THERMAL-ASYMMETRIC INTERLACED POLYMERASE CHAIN 
REACTION (TAIL PCR) 

TAIL PCR was performed using the 5-Prime Taq DNA poly- 
merase (5-Prime, USA) in a G-STORM® PCR System (Somerton 
Biotechnology Centre, UK) as per the supplier's recommenda- 
tion. Genomic DNA isolated from plants with different levels 
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of GUS expression was subjected to two separate PGR runs. 
Primer sequences are shown in Supplementary Table 2. The 
reaction mix for the primary amplification was prepared in a 
total volume of 20 \iL and contained 1 x 5-Prime Taq DNA 
polymerase buffer, 1 U 5-Prime Taq DNA polymerase, 0.2 mM 
dNTPs, 0.2 |aM specific left border primer (LB_pRGS2_Fl), and 
3 (jlM of each arbitrary degenerate primer (ADl, AD2, AD3, 
and AD4). Each arbitrary degenerate primer was paired to the 
specific left border primer, resulting in four primer pair reac- 
tions for each sample. The PGR program for the primary TAIL 
PGR included 5 cycles at 94°G for 1 min, 55°G for 1 min, and 
72°G for 2.3 min, 1 cycle at 94°G for 30 s, 44 °G for 1 min, and 
72°G for 2.3 min, 15 cycles at 94°G for 30 s, 55°G for 1 min, 
and 72°G for 2.3 min, and the last cycle at 94°G for 30 s, 44°G 
for 1 min, and 72° G for 2.3 min, with a final elongation step 
at 72° G for 5 min. After the primary TAIL PGR, amplification 
products were diluted 10 x for the secondary amplification. The 
reaction mixture contained the same components as the pri- 
mary PGR, except for a different specific left border primer 
(LB_pRGS2_F2). The program included 20 cycles at 94°G for 30 s, 
55°G for 1 min, 72°G for 2.3 min, 94°G for 30 s, 44°G for 1 min, 
and 72° G for 2.3 min, followed by a final elongation step at 72° G 
for 10 min. 

All TAIL PGR products were visualized on a 1.5% agarose 
gel. PGR products were excised and purified using a QIAquick 
gel extraction kit (Qiagen, USA). The purified products were 
ligated in a pGEM T-easy vector system (Invitrogen, USA) and 
transformed in E. coli DH5a strain using the heat-shock method. 
Following bacterial transformation, the white/blue screening 
method was used to identify the positive colonies, which were 
then grown in LB liquid medium overnight at 37° G. Plasmid 
DNA was extracted using a Purelink® Quick Plasmid Miniprep kit 
(Invitrogen, USA) and digested using EcoRI restriction enzyme 
(Invitrogen, USA) to check for the presence of the ligated PGR 
product. Subsequently, plasmids were sequenced and analyzed 
with bioinformatic tools (Macrogen, Korea). 

BIOINFORMATIC ANALYSIS 

NGBI BLASTn (http://blast.ncbi.nlm.nih.gov/Blast.cgi) was used 
to locate the sequenced regions on the corresponding chromo- 
somes. Sequences were checked using the rice annotation sites 
MSU (http://rice.plantbiology.msu.edu), RAP-DB (http://rapdb. 
dna.affrc.go.jp/), and Gramene (www.gramene.org/). Nucleotide 
sequences were transcribed to proteins using Expasy Tools (http:// 
web.expasy.org/translate/) and the putative protein sequences 
were verified using the UniProt database (www.uniprot.org/ 
uniprot/). 
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